DFKI-LT at the CLEF 2006 Multiple Language Question Answering Track
نویسندگان
چکیده
The paper describes QUANTICO, a cross-language open domain question answering system for German and English. The main features of the system are: use of preemptive off-line document annotation with syntactic information like chunk structures, apposition constructions and abbreviation-extension pairs for the passage retrieval; use of online translation services, language models and alignment methods for the cross-language scenarios; use of redundancy as an indicator of good answer candidates; selection of the best answers based on distance metrics defined over graph representations. Based on the question type two different strategies of answer extraction are triggered: for factoid questions answers are extracted from best IR-matched passages and selected by their redundancy and distance to the question keywords; for definition questions answers are considered to be the most redundant normalized linguistic structures with explanatory role (i.e., appositions, abbreviation’s extensions). The results of evaluating the system’s performance by CLEF were as follows: for the best German-German run we achieved an overall accuracy (ACC) of 42.33% and a mean reciprocal rank (MRR) of 0.45; for the best English-German run 32.98% (ACC) and 0.35 (MRR); for the German-English run 17.89% (ACC) and 0.17 (MRR). Categories and Subject Headings H.3 [Information Storage and Retrieval]: H.3.1 Content Analysis and Indexing; H.3.3 Information Search and Retrieval; H.3.4 Systems and Software; I.7 [Document and Text Processing]: I.7.1 Document and Text Editing; I.7.2 Document Preparation; I.2 [Artificial Intelligence]: I.2.7 Natural Language Processing
منابع مشابه
DFKI's LT-lab at the CLEF 2005 Multiple Language Question Answering Track
This report describes the work done by the QA group of the Language Technology Lab at DFKI for the 2005 edition of the Cross-Language Evaluation Forum (CLEF). We describe the extensions made to our 2004 QA@CLEF German/English QA–system, especially the question–type driven selection of answer strategies. Furthermore, details concerning the processing of definition and temporal questions are desc...
متن کاملA Cross-Language Question/Answering-System for German and English
This report describes the work done by the QA group of the Language Technology Lab at DFKI, for the 2003 edition of the Cross-Language Evaluation Forum (CLEF). We have participated in the new track “Multiple Language Question Answering (QAatCLEF)” that offers tasks to test monolingual and cross-language QA–systems. In particular we developed an open–domain bilingual QA–System for German source ...
متن کاملDFKI-LT at QA@CLEF 2007
This Working note shortly presents QUANTICO, a cross-language open domain question answering system for German and English document collections. The main features of the system are: use of preemptive off-line document annotation with information like Named Entities, sentence boundaries and pronominal anaphora resolution; online extraction of abbreviation-extension pairs and appositional constru...
متن کاملRACAI's Question Answering System at QA@CLEF 2007
This paper presents a pattern-based question answering system for the Romanian-Romanian task of the Multiple Language Question Answering (QA@CLEF) track of the CLEF 2007 campaign. We show that working with a good Boolean searching engine and using question type driven answer extraction heuristics, one can achieve acceptable results (30% overall accuracy) using simple, pattern-based techniques. ...
متن کاملExperiments on Robust NL Question Interpretation and Multi-layered Document Annotation for a Cross-Language Question/Answering-System
This report describes the work done by the QA group of the Language Technology Lab at DFKI, for the 2004 edition of the Cross-Language Evaluation Forum (CLEF). Based on the experience we obtained through our participation at QA@Clef-2003 with our initial cross-lingual QA prototype system BiQue (cf. [NS03]), the focus of the system extension for this year’s task was a) on robust NL question inte...
متن کامل